Web-Scale Semantic Product Search with Large Language Models

نویسندگان

چکیده

Abstract Dense embedding-based semantic matching is widely used in e-commerce product search to address the shortcomings of lexical such as sensitivity spelling variants. The recent advances BERT-like language model encoders, have however, not found their way realtime due strict inference latency requirement imposed on websites. While bi-encoder BERT architectures enable fast approximate nearest neighbor search, training them effectively query-product data remains a challenge instabilities and persistent generalization gap with cross-encoders. In this work, we propose four-stage procedure leverage large models for while preserving low latency. We introduce interaction pre-finetuning pretrain bi-encoders improve generalization. Through offline experiments an dataset, show that distilled small BERT-based (75M params) trained using our approach improves relevance metric by up 23% over baseline DSSM-based similar only suffers 3% drop compared 20x larger teacher. also online A/B tests at scale, production exact substitute products retrieved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semantic Constraint and QoS-Aware Large-Scale Web Service Composition

Service-oriented architecture facilitates the running time of interactions by using business integration on the networks. Currently, web services are considered as the best option to provide Internet services. Due to an increasing number of Web users and the complexity of users’ queries, simple and atomic services are not able to meet the needs of users; and to provide complex services, it requ...

متن کامل

SWSNL: Semantic Web Search Using Natural Language

As modern search engines are approaching the ability to deal with queries expressed in natural language, full support of natural language interfaces seems to be the next step in the development of future systems. The vision is that of users being able to tell a computer what they would like to find, using any number of sentences and as many details as requested. In this article we describe our ...

متن کامل

KAON - Towards a Large Scale Semantic Web

The Semantic Web will bring structure to the content of Web pages, being an extension of the current Web, in which information is given a welldefined meaning. Especially within e-commerce applications, Semantic Web technologies in the form of ontologies and metadata are becoming increasingly prevalent and important. This paper introduce KAON the Karlsruhe Ontology and Semantic Web Tool Suite. K...

متن کامل

Design Alternatives for Large - Scale Web Search :

Indexing the Web and meeting the throughput, responsetime, and failure-resilience requirements of a search engine requires massive storage and computational resources and a careful system design for scalability. This is exemplified by the big data centers of the leading commercial search engines. Various proposals and debates have appeared in the literature as to whether Web indexes can be impl...

متن کامل

Web Semantic Search with TUCUXI

Traditional search engines rely on keywords to locate Web documents that best fit a user’s query. Since words extracted from their context do not always capture the intended meaning, the relevance of the retrieved documents is affected by the natural language ambiguity. TUCUXI is a semantic search tool that replaces keywords with an ontology-based expression of the user’s requests. TUCUXI judge...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2023

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-33380-4_6